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Abstract. In competing risks models, the joint distribution of the event 
times is not identifiable even when the margins are fully known, which 
has been referred to as the "identifiability crisis in competing risks analy- 
sis" (Crowder, 1991). We model the dependence between the event times 
by an unknown copula and show that identification is actually possible 
within many frequently used families of copulas. The result is then ex- 
tended to the case where one margin is unknown. 
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1. Introduction 

The theory of competing risks is concerned with the analysis of multiple possible 
causes of a certain event ("risk") in a system of interest. As an example, consider 
an animal experiment in which mice die either from a disease (event time X) or 
from the side effects of some treatment (event time Y). In this setting, one observes 
only the minimum of the two event times, corresponding to the time of death, and 
a variable indicating the cause of death, that is 

Z:=XAY and A:=1 x <y- (1-1) 

In this paper, we investigate under which conditions the joint distribution of (X, Y) 
can be identified based on the observations. Of course, in practice one could consider 
many more potential risks even in this simple example, but in this paper we treat 
the bivariate case exclusively. The focus lies in particular on the case where there 
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is dependence between the two risks X and Y, which is modelled by a bivariate 
copula C := Cxy ■ Indeed, the assumption of independence between X and Y seems 
unrealistic in a practical context. In the animal experiment example, one might 
suspect a positive correlation between the two risks, because it is conceivable that 
the vulnerability to the treatment's side effects depends on the course of the disease. 

The identifiability of joint distributions in competing risks models is not a recent 
topic. Already in the 1950s, Cox (1959) pointed out an identifiability problem in 
the bivariate case for independent risks, and Tsiatis (1975) showed in the general 
multivariate case that the joint distribution of the failure times cannot be identified 
by their minimum. Crowder (1991) showed that "the situation is even worse than 
previously described" - even when the marginal laws of X and Y are known, the 
joint distribution function is not identifiable. This is a very undesirable property of 
the model, and so Crowder called out the "identifiability crisis in competing risks 
analysis". 

Tsiatis' observations have been the starting point for investigations on conditions 
or modifications of the model which allow for identification and estimation of the 
event time distribution, and a variety of such models has been studied over the past 
decades. In order to obtain identifiability of the joint distribution of X and Y, one 
has to restrict the class of possible models. For example, one can exclude the inde- 
pendent case and restrict the class of admissible dependence structures. Basu and 
Ghosh (1978) follow this approach and show identifiability of the bivariate normal 
and the bivariate exponential distributions introduced in Marshall and Olkin (1967) 
and Gumbel (1960). Instead of fixing the precise dependence structure of the joint 
distribution in advance, Slud and Rubinstein (1983) suppose that a certain hazard 
ratio involving the event and the censoring time is known. Under such assumptions, 
they derive pointwise bounds on the marginal survivor function. Ebrahimi et al. 
(2003) use still a different condition, involving a partial derivative of the conditional 
survivor function of the event time given that the censoring time is larger than a 
given threshold. 

Another approach which has been the topic of many research articles consists in 
modelling the dependence structure between the event and the censoring time using 
a copula. In view of Sklar's theorem, this kind of model allows for flexibility in the 
modelling of the dependence structure without affecting hypotheses on the margins. 
The identifiability of the copula and the margins can then be treated in two separate 
steps. 

Zheng and Klein (1995) and Rivest and Wells (2001) suppose that the event and 
censoring times are dependent via some known copula that is nowhere constant. They 
show that their marginal distributions are identifiable if their support is (0, oo) and 
develop the "copula graphic estimator". Klein and Moeschberger (1988) also work 
under the hypothesis of a known copula; more specifically, they choose a Clayton 
copula with known parameter. 

Of course, supposing the copula to be known is quite as unrealistic as supposing 
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the risks X and Y to be independent, but in view of the results in Crowder (1991) 
cited above, identification is impossible if the copula is completely unknown. It is 
therefore natural to ask if the copula can be identified within certain (parametric) 
classes. It is the scope of the present work to answer this question. To this end, we 
proceed as follows. 

In Section 2, we assume both of the marginal distributions to be entirely known 
and concentrate on the identification of the copula C which describes the depen- 
dence between the risks X and Y. Although the practical relevance of this setting is 
limited, its investigation is very instructive. We develop assumptions on the family 
of admissible copulas such that identification is possible based on the joint density 
of the observations, and we show that various parametric classes of copulas do actu- 
ally satisfy such conditions. In particular, making use of a recent result by Wysocki 
(2012), we prove that many well-known classes of Archimedean copulas are identifi- 
able. Certain classes of asymmetric copulas turn out to be identifiable as well. We 
also give examples of symmetric and asymmetric classes of copulas which cannot be 
identified. 

In Section 3, we treat the more realistic case where Fx is completely unknown. 
Not surprisingly, additional model assumptions are needed in order to obtain iden- 
tifiability in this more general setting. In particular, we assume that A and Z are 
stochastically independent. In the case where the copula Cxy is known, Braekers 
and Veraverbeke (2008) develop an estimator of Fx even when Fy is unknown. In 
contrast to this, we show that even if the copula is unknown, Fx can be identified if 
Fy is known. It is obviously impossible to consider all copulas. However, we will see 
that the same subclasses of Archimedean copulas as in Section 2 allow for identifica- 
tion. We will elaborate on the advantages and limitations of the model in Section 3 
and discuss its relation to the classical Koziol-Green model. 

The simultaneous identification of Fx and the copula in more general models 
remains an open question. Though the techniques developed in the present work 
will hopefully prove helpful in its solution, the identifiability crisis is not over yet 
and requires further investigation, as we finally discuss in Section 4. 

2. Known margins 

Throughout this section, suppose that the distributions Fx and Fy of event and 
censoring time X and Y are known. The observations {Z, A)sKx {0, 1} are given 
by (1.1). Assuming that the copula C is absolutely continuous with respect to the 
Lebesgue measure on the unit square with density c in the sense that 




we can write 
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C(F x (x),F Y (y)) 



F x {x) r F Y (y) 



P[X < x,Y < y] 
Consequently, 

P[Z > z,A = 1] = P[Y > X > z] 

lx=F x (z) Jy=F Y {F-\x)) 

and we obtain for the joint density h of (Z, A) that 

•1 g 



c(u, v)dvdu 



c(x, y)dydx 



h(z,0) = f Y (z) 
h(z,l) = f x (z) 



c(x,F Y (z))dx = f Y (z) - —C(F x (z),F Y (y)) 

x=f x {z) °y 

c(F x (z),y)dy = f x (z) - ^-C(F x (x), F Y (z)) 

y=F Y (z) OX 



y=z 



(2.1) 



Note that this implies the basic equality 

C(F x (z),F Y (z)) = F x {z) + F Y {z)-H(z) 



(2.2) 



where H denotes the distribution function of Z. From observation (2.1), one sees 
that two copulas can be distinguished based on the distribution of (Z, A) if and only 
if either of their partial derivatives do not coincide on the curve 



* = (F x (t),F Y (t))t&L. 



(2.3) 



This proves the following result. 



Theorem 2.1 Let X and Y be random variables with differentiable distribution func- 
tions F x and F Y , respectively, jointly distributed according to an unknown copula 
belonging to a class C of copulas having a density, and let (Z, A) be the observable 
random variables defined in (1.1). The class C is identifiable based on the joint dis- 
tribution of (Z, A) if and only if for any two different copulas C,C £ C there exists 
z € K such that 



d_ 
dx 



C(F x (x),F Y (z)) 



+ ^-C'(F x (x),F Y (z)) 







or —C(F x (z),F Y (y)) 



y=z 



+ ~C'(F x (z),F Y (y)) 
dy 



y=z 



This theorem provides us with a necessary and sufficient identifiability criterion for 
arbitrary classes of absolutely continuous copulas and margins admitting densities. 
In practice, this criterion will be difficult to verify for a given class, though. As 
noted, the class of all copulas is far too big to be identifiable as has been shown by 
Crowder (1991). In the remainder of the present section we will therefore restrict 
our attention to more particular classes of copulas and develop identification criteria 
that are easier to apply. 
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2.1. Symmetric copulas 

It follows from representation (2.1) that whenever Fx = F Y and in addition both first 
partial derivatives of C with respect to x and y coincide along the curve \I/ defined 
in (2.3), the joint density h does not depend on d, which happens for instance when 
the copula is symmetric. In this case, the curve '3/ is the diagonal of the unit square, 
and the density of Z takes the simpler form 

q 

h(z) = 2f Y (z) - 2—C(F Y (z), F Y (y))\ y=z (2.4) 

or, equivalently, 

S c (F Y (z)) := C(F Y (z),F Y (z)) = 2F Y (z) - H{z), (2.5) 

where H denotes the distribution function of Z and where 5c is called the diagonal 
section of C. 

Theorem 2.2 Suppose that the marginal distributions Fx and F Y are arbitrary but 
known. Then, the class of all symmetric copulas is not identifiable. 

Proof. Consider first the case Fx = F Y . Then the curve \£ defined in (2.3) is the 
diagonal of the unit square. In order to show the non-identifiability of the class 
of symmetric copulas, we construct two distinct symmetric copulas which coincide 
on a strip of positive width around the diagonal and which hence yield the same 
distribution H. To this end, define first functions from [0, l] 2 to {0, ±1} according 
to 

?(»,») M) :=sgu[(a-x)(t-y)] l{[(|s - x\ V \t - y\) < r/]} 

+ sgn[(s -y)(t- x)\ l{[(|s - y\ V \t - x\) < r?]}. 

The support of the function qj , ^ consists of two squares of side length 2r/ centred in 
(x,y) and (y,x), respectively. These squares in turn consist of four smaller squares 
on which Q? xy \ is constant +1 or —1. Now let C be a symmetric copula admitting 
a density c and let x,y,rj,e G (0, 1) be such that supp(g^,^) is in [0, l] 2 and does 
not intersect with a neighbourhood of the diagonal as illustrated in Figure 1 and 
c(s, t) > e for (s, t) G supp(g^, a). It is then easy to see that the function c + eq^ x ^ 
is a density yielding a symmetric copula C which coincides with C on a band of 
positive width around the diagonal. Thus, the two different symmetric copulas C 
and C give rise to the same distribution H according to (2.1), showing that the class 
of symmetric copulas is not identifiable. 

The general case Fx ^ F Y is straightforward applying the appropriate transfor- 
mations. □ 
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Figure 1: The support of the piecewise constant function cfl ^ consists of two squares 
that must not intersect with a neighbourhood of the diagonal 



2.2. Archimedean copulas 

We have seen that the class of all symmetric copulas is not identifiable. In this 
section, we restrict our attention further and consider only Archimedean copulas, 
that is copulas of the form 

C(x,y) = ^~ l \p{x) + ^{y)) 

for some generator function ip £ Q, where £1 denotes the set of decreasing convex 
functions from [0, 1] to [0, oo] with = 0. The pseudo-inverse c^[ -1 l of <p is given 

by 

hl] _ r v -i(t) if o < * < v(o) 

V [) ~\ ift>y>(0). 

In the case where cp is unbounded, the associated copula is called strict and we 
have Because of their nice properties, Archimedean copulas are very 

commonly used in the modelling of dependence and are therefore an interesting 
family of copulas to consider. 

Recent work by Wysocki (2012) on the construction of copulas from diagonal 
sections allows us to state an identification result for a subfamily of the class of 
Archimedean copulas in the special case where the margins are equal. 

Theorem 2.3 Suppose that the marginal distributions Fx and Fy are arbitrary but 
known. If Fx = Fy, then the subclass of strict Archimedean copulas generated by 

fii ;={ip € O I ip(0) = oo, ]hncp'(t) < 0} 

is identifiable. 



Proof. The generator of an Archimedean copula being uniquely determined up to a 
multiplicative constant, we may assume that lim^i <p'(t) = — 1 without loss of gen- 
erality. Lemma 1 in Wysocki (2012) states that under this assumption, the diagonal 
sections of two Archimedean copulas coincide (up to a multiplicative constant) if 
and only if their generators do so, that is, if the copulas are the same. As by virtue 
of (2.5), the diagonal section 5c of the copula can be identified when the margins 
are equal, this implies the identifiability of the whole copula. □ 

Theorem 2.3 implies identifiability for several well-known classes of Archimedean 
copulas when both margins are equal on the unit interval. Important examples of 
such classes can be found in Nelsen (2006, Table 4.1); in particular, the Frank 
copulas are identifiable. It is noteworthy that obviously even the union of all the 
identified subclasses of £li is identifiable. But the result is restricted to the special 
case of equal margins and to families of strict copulas. For example, the class of strict 
Clayton copulas is identifiable by virtue of the theorem, but some Clayton copulas 
are not strict {9 < 0). Also note that the theorem cannot be applied to every family 
of strict Archimedean copulas. For instance, the result does not apply to the Gumbel 
family, because (f'g(l) = for all 6 > 1. 

We have seen that Theorem 2.3 allows for identification within quite a large class 
of strict copulas, but only in the special case of equal margins. We still need a more 
general criterion for classes of strict copulas which are not covered by the theorem 
as well as for nonstrict copulas and general margins. 

Before stating the next result, let us recall that the level sets of a copula C are 
given by {(u,v) E [0, l] 2 | C(u,v) = t}. For an Archimedean copula and for t > 0, 
this level set consists of the points on the level curve <p(u) + <p(v) = <p(t). For t = 0, 
this curve is called zero curve; it is the boundary of the copula's zero set. All level 
curves of an Archimedean copula are convex. 

Theorem 2.4 Suppose that the marginal distributions Fx and Fy are arbitrary but 
known. Let := {<pg \ 6 G C M} C f2 be a family of generators and write Cg for the 
Archimedean copula generated by tpg. For z G [0, 1] denote by ( s (z) = (Cx{ z )> Cy ( z )) 
the intersection point of the copula Cg 's level curve of level z with the curve ^ given 
in (2.3). If for some z G [0, 1] at least one of the two coordinates of £ 6 (z) is strictly 
monotonic as a function from to [0, 1], then the class of copulas generated by $ is 
identifiable. 

Proof. Suppose without loss of generality that Cx(z) i s strictly monotonic in 0. The 
distribution function H of (Z, A) is obviously identifiable. It is thus sufficient to show 
that the true parameter can be computed based on the knowledge of H, which 
we therefore suppose to be known. Then, by virtue of (2.2) and using that both 
margins are known, the copula can be reconstructed completely along the path ^f. 
Consequently, the coordinate Cx{ z ) can ^ e computed in this case for any choice of z. 
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Figure 2: An Archimedean copula's zero curve and its intersection with the curve 
= (Fx(t), i ? y(t))tgR (on the left), and the special case Fx = Fy (on the 
right). The copula's zero set is shaded in grey. 

(See Figure 2 on the left for an illustration of the case z = 0.) The strict monotonicity 
in 9 ensures that Cx( z ) uniquely characterises 6 and thus Cq. □ 

Example 2.5 Let us consider the special case where the marginal distributions Fx 
and Fy are the same and strictly monotonic on their whole support. Let C be 
an Archimedean copula with generator <p E £1. It is easily seen that we have, for 
a? €(0,1], 

5 c (Fx(x)) = C(Fx(x),F x (x)) = z <=> ip^(2ip(F x (x))) = z 

<p(F x (x)) = ±<p(z) Fx(x) = ^(l^z)) =: Sc\z). 

For z = 0, the second equivalence is not true in general. Nevertheless, the same 
definition of 5^} can be used for z = as well, with the convention that <5^ 1 (0) = 
sup{x £ K | 5c(x) = 0}. The intersection point £(z) of the copula's level curve of 
level z with the diagonal can be expressed in terms of the inverse diagonal section, 
namely C,{z) = (5q 1 (z), 5q 1 (z)). A parametric class of Archimedean copulas {Cg} is 
thus identifiable if for some z, the inverse diagonal section 8q {z) is strictly monotonic 
as a function of 9. 

Choosing z = 1/2, this condition can be applied to the Gumbel family. Recall 
that the Gumbel family could not be treated with Theorem 2.3. If ip is bounded 
(i.e. if C is nonstrict), 5^ 1 (0) is strictly positive and corresponds to the intersection 
of the diagonal with the boundary of the copula's zero set, see Figure 2 on the 
right. Supposing that ip belongs to a parametric class, knowledge of 5^ 1 (0) is often 
sufficient in order to characterise C within the class. Within the whole Archimedean 
class, though, there exist different copulas having the same zero set (cf. Nelsen, 2006, 
p. 132), such that this criterion fails. 
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Theorem 2.4 has a corollary that provides us with an identifiability criterion in terms 
of the generator. 

Corollary 2.6 Let := {<p>g \ 9 € C M} C f2 be a family of bounded generators such 
that 9 i — y ipg(t)/(pg(0) is strictly monotonic in 9 for every t G (0,1). Suppose that 
the marginal distributions Fx and Fy are arbitrary but known. Then, the copulas 
generated by the class <3> are identifiable. 

Proof. Recall that a constant times a generator results in the same Archimedean 
copula. Thus, without loss of generality, we can consider the following class of 
generators instead of $ which generates the same copulas: $ = {(pg := {pg/{pg(0) \ 9 G 
C M} C 0. Note that by construction <p>g(0) = 1 for all 9 and <p>g(t) is strictly 
monotonic as a function of 8 for all t 6 (0, 1) (say strictly increasing, without loss of 
generality) . 

Consider the intersection point of the copula's zero curve with the curve \I/ 
defined in (2.3). This point can be identified by means of the distribution function H 
(cf. Theorem 2.4). We show below that for 9' > 9, the zero curve of Cg lies strictly 
above the one of Cgi on (0,1). This implies that two different choices of 9 always 
result in two different intersection points Q and thus completes the proof. 

The zero curve of the copula generated by (p is given by t i— > <£ _1 (1 — <p(t)) for 
t € [0, 1]. Let t £ (0, 1). We have that (using Lemma A.l in the penultimate step) 



(f>B'{t) > (pg(t) 
^ (p e ,{t) > (pg{t) - (<pg(0) - w(0)) 

V v ' 

=(1-1)=0 

<Pe(0) - <fe(t) > (pe'{(f>Q^{(pe'{^) - <P0'(t))) 
=> <pe(0) - <pe(t) > <pe(<Pe> 1 (ft8'(. ) ~ &>'(*))) 
<pJ 1 ('P$(0) - (p e {t)) < <py-{<pe>(Q) ~ V>e'(t)). 

This completes the proof. □ 

Example 2.7 The class (4.2.2) from Nelsen (2006), which is given by := {(fg(t) = 
(1 — t) e | 9 £ [1, oo)}, is identifiable by virtue of Corollary 2.6, because <pe(Q) = 1 f° r 
all 9, and (fig(t) is strictly decreasing in 9 for all t £ (0, 1). In fact, Corollary 2.6 can 
be applied successfully to all classes of bounded generators listed in Nelsen (2006, 
Table 4.1). As for most of the remaining classes (strict or not), Theorem 2.4 can be 
applied successfully. The required computations become rather unwieldy, though. 
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2.3. Asymmetric copulas 

So far we have restricted our attention to families of symmetric copulas, which contain 
some of the most widely used classes. In this section, we discuss briefly some families 
of asymmetric copulas. Let us consider a copula density of the form c(x, y) = c{x — y) 
with a 1-periodic function c : M. — > [0, oo) such that f c(x) dx = 1. Alfonsi and Brigo 
(2005) show that this density actually yields a copula C which they call a periodic 
copula. Note that such a periodic copula can be asymmetric (non-exchangeable) 
when c is not an even function. Let us consider two examples in which classes of 
asymmetric copulas are identifiable in the context of the competing risks model in 
the special case where the margins are uniform. The construction of these classes is 
according to Alfonsi and Brigo (2005). 

Example 2.8 (Periodic jump copulas) Suppose that X and Y are uniformly dis- 
tributed on the unit interval. For 7 S (0, 1/2), let c 7 be the periodic continuation on 
R of 7~ 1 l[o 7 ](x) (with < x < 1), denote by C 7 the corresponding periodic copula, 
and let C := {C 7 | 7 € (0, 1/2)}. It is easy to see that 



d 



z 

- A 1. 
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This implies that for two different 7,7' £ (0,1/2), we have that C Xj7 (7 A 7') 7^ 
C 2 -y/(7 A 7'), whence the identifiability of C. 

There are also classes of asymmetric copulas that are not identifiable. We end this 
section with such a negative example. 

Example 2.9 (Generalised Cuadras-Auge family) This class is defined by 

C a)j 3(u,v) 



u 1 a v u a > 



uv 



for < a, (3 < 1 (cf. Nelsen, 2006, p.52ff). The domain of such a copula is divided 
into two parts by the graph of the function fa t /3(x) = x a ^ . In the upper left part 
of the domain, the copula only depends on the parameter j3, in the lower right part 
only on a. Thus, the parameter a is obviously determined by the value of the copula 
at any single point in the lower right part of the domain, and the parameter /3 by 
such a value in the other part of the domain. 

Recall that by virtue of (2.2), the copula's values along the curve ^ defined in (2.3) 
are identifiable based on the distribution of (Z, A). In view of the above discussion 
this means that a copula's parameters a and (3 can be identified if the curve \& 
crosses the graph of the function / Qj( g and, more precisely, there are two points on ^ 
of which we know that they lie on opposite sides of the graph of f a ,i3- Observe that 
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the curve \E r is the graph of the function 

^ : [0, 1] -> [0, 1] : u^F Y (F^(u)) 

and suppose that the marginal distributions Fx and Fy are such that the expression 
log ip(u)/ log u is nowhere constant as a function of u. The latter condition amounts 
to assuming that there is no constant r > such that Fx(t) = Fy{t) r on any interval 
inside [0, 1]. 

Let zGl and u := u(z) := Fx(z). Then, we have that H(z) = C(u,ip(u)) and, 
by definition of the Cuadras-Auge copulas, 

_ fu 1 - a ^(it) u a > ip{uf 
u a < ip(uf. 

Dividing by uip(u), taking the logarithm, and finally dividing by log it, we obtain 

_ log H(z) - log it - log ip(u) _ i-a ip(u) < f a ,p{u) 

{Z) '~ log« " ' > /«,*(«). 

The left hand side of the last equation is identifiable based on the distribution of 
(Z, A) and we may therefore consider it as known. The right hand side is constant 
for tp(u) < f a ,p{u) and nowhere constant otherwise by assumption. Suppose that 
the curve ^ lies on both sides of the graph of f a ^. On the one hand, function T is 
then constant on some intervals where it takes the value —a. On the other hand, 
there are intervals where T is nowhere constant and T(z) log(u)/ log(^(it)) = —(5. 
As log(i/)/log(i/>(ii)) is known, both parameters a and (3 are identifiable in this case. 
To summarise, we obtain the following sufficient identifiability condition: 

A set of parameters A C (0, l) 2 \ {(t,t) | t G (0, 1)} defining a subclass of the 
generalised Cuadras-Auge family is identifiable if 

V(a,fi)eA 3xi,x 2 G(0, 1) : ip(xi) > f a ,p{xi) and tp(x 2 ) < f a ,p(x 2 ) 

and if additionally log ^(it)/ log it is nowhere constant as a function of u. 

If the latter condition is not satisfied, but we know two points on ^ that lie on 
opposite sides of the graph of / QjJ g for every (a, (3) G A, then the parameters are still 
identified, which yields the following condition: 

3 x\,X2 £ (0, 1) V(a,/3)eA : V(^i) > fa,p{xi) and ip(x 2 ) < f a ,p(x 2 )- 
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3. One unknown margin 



In this section, we consider the case where one of the two margins is unknown. 
Our objective is to show that the unknown margin Fx and the copula Cxy can be 
identified simultaneously under certain conditions. To this end, we need stronger 
assumptions than in the previous section, where both margins were entirely known. 
More specifically, we will suppose that the two observed variables 



As far as the family of copulas is concerned, we use the same conditions as in Corol- 
lary 2.6. We show below that in this setting, the copula is identifiable. 

It is noteworthy that in the context of classical survival analysis, i.e. assuming 
independence of the competing risks X and Y, the additional condition (3.1) is 
equivalent to assuming the so-called Koziol-Green model. In this censoring model, 
one assumes that the survival function of the event time X is a power of the survival 
function of the censoring time Y . The Koziol-Green model has been applied success- 
fully in some practical situations. For example, Koziol and Green (1976) and Csorgo 
and Horvath (1981) consider prostate cancer data, whereas Csorgo (1988) treats the 
Channing House data from Hyde (1977). Nevertheless, the classical Koziol-Green 
model has been criticised as unrealistic in many settings, e.g. Csorgo and Faraway 
(1998) called it "too good to be frequently true". Numerous modifications of the orig- 
inal model have been suggested, e.g. the Generalised Koziol-Green model in which 
hypothesis (3.1) is weakened by assuming that A and Z may be dependent, but ac- 
cording to a known copula. It is not within the scope of this paper to contribute to 
the debate on the usefulness of the Koziol-Green model and its modifications. Our 
purpose is rather the extension of the identification results of the previous section to 
the context of dependent competing risks with one unknown margin. 

When X and Y are allowed to be dependent, condition (3.1) is weaker than the 
original "proportional hazards" assumption made by Koziol and Green (1976). Braek- 
ers and Veraverbeke (2008) consider this case and develop a strongly consistent esti- 
mator of Fx in this context, even when Fy is unknown. However, they assume the 
copula Cxy to be known. The next result shows that under assumption (3.1), the 
distribution function Fx and the copula Cxy can be identified simultaneously (the 
latter within certain parametric classes). 

Theorem 3.1 Suppose that A and Z are independent and that Fy is known. Assume 
further that both Fx and Fy are continuous and strictly increasing in (0, oo). As in 
Corollary 2.6, let $ := {tpg \ 9 £ O C M} C ft, be a family of bounded generators 
such that 9 \— > (fe(t) j <p$(Q) is strictly monotonic in 9 for every t £ (0, 1). Then, the 
unknown marginal law Fx and the copulas generated by the class <3? are identifiable. 



A and Z are stochastically independent. 
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Proof. For given z £ (0, 1), the level curve of level z intersects with the curve 
(F x (t),F Y (t)) tm for 



or equivalently at the point 

-i] 



J-l] 



{- ! <p' e (H(s))dH(s,l)\ 



i pi 

v' (H(s))dH(s,l)},<p [ i 1] \- / <p' g (H(s))dH(s,0)}) 
:=(£(*),#(*)). (3.2) 

In order to prove (3.2), we need to show that C(£j(;z), = -2- Since C(x,y) = 

ifg 1 ' (ip$(x) + tpe(y))i this means that we need to show that y?e(^(z)) + ye(Cy C 2 )) = 
<Po(z). Indeed, 



MC(z)) + MC e y (z)) 

= - [ <Pe(H(s)) dH(s, 1) - / <Jo(H(s)) dH{s, 0) 

JH- 1 (z) JH-i^iz) 

<p' e (H(s))dH(s) 

H-Hz) 

1 

ip' e (u)du = ipe(z), 

since <pg(l) = 0. The independence of A and Z implies that there is a constant 
a € [0, 1] such that 

H(s,0) = aH(s) 
H(s,l) = (l-a)H(s), 

and thus, using (3.2), we can write Cyi z ) = <Pq ^ { aL Pe{ z ))- Following the proof of 
Corollary 2.6, we obtain that £^(0) is strictly monotonic in 6 under the assumptions. 

Now let £ := sup{t G [0, 1] | C(F x (t),F Y (t)) = 0}. Note that in view of (2.1), £ is 
identifiable based on the distribution of (Z, A). Furthermore, we have (0) = -Fy (£), 
which then implies the identifiability of 6 using the monotonicity of (0) in 9. Having 
identified 6, we may consider the copula as known. The result then follows applying 
Theorem 3.1 from Zheng and Klein (1995). □ 



13 



4. Discussion 



We have considered the problem of identifiability in a bivariate competing risks 
model with dependence between the two event times X and Y . It is well known that 
without further restrictions on the model, the joint distribution of the event times is 
not identifiable when only the minimum Z of the two times and the indicator A are 
observed. 

First, we showed as an intermediate step that when the marginal distributions 
of X and Y are known, their copula is identifiable within many popular families 
of Archimedean copulas, but also in some less known families. However, in many 
applications at least one of the marginal distributions is the object of interest that 
has to be estimated. 

In the case where only one of the two margins is known, we were still able to 
establish an identification result, but we needed the additional assumption that Z 
and A are stochastically independent. Although this hypothesis can be tested in 
practice, it would be desirable to replace it by a weaker assumption. One could 
imagine that Z and A are dependent via some copula Cza, but it is not obvious in 
which way this copula has to be related to the copula Cxy an d how identifiability 
can be shown in this setting. 

The problem becomes still harder when both margins are completely unknown. 
Without any further assumptions on the margins, the class of all Archimedean copu- 
las is too large as to allow for identification (Wang, 2012), and at present, we do not 
know if identification is possible for certain subfamilies of Archimedean copulas. One 
could try to tackle this problem assuming the margins to lie in parametric classes. 

The generalisation of the results in this paper to the multivariate case should be 
straightforward, at least for the Archimedean families of copulas. Further research is 
needed for the development of estimation methods in the models that we have shown 
to be identifiable. 

A. Appendix 

Lemma A.l If {fe} is a family of monotonic bijective functions fg : [0,1] — > [0,1] 
such that fg(t) is strictly increasing in 9 for every t G (0,1), then fa~ (t) is also 
strictly increasing in 9 for every t G (0, 1). 

Proof. For 9' > 6 and a given t G (0, 1), let z := fg, 1 ^). Then, we have t = fg>(z), 
and consequently 

fff\t) > f e \t) ^ z > f e \fe>(z)) ^ fe(z) < f e/ (z), (A.l) 

which is true by assumption (see Figure 3 for an illustration of the second inequality) . 

□ 
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Figure 3: Illustration of inequality (A.l) 

Acknowledgments 

I. Van Keilegom and M. Schwarz acknowledge financial support from the IAP re- 
search network P7/06 of the Belgian Government (Belgian Science Policy) and from 
the European Research Council under the European Community's Seventh Frame- 
work Programme (FP7/2007-2013) / ERC Grant agreement No. 203650. I. Van Kei- 
legom acknowledges further financial support from the contract "Projet d'Actions de 
Recherche Concertees" (ARC) 11/16-039 of the "Communaute francaise de Belgique", 
granted by the 'Academie universitaire Louvain". 

References 

Alfonsi, A. and Brigo, D. (2005). New families of copulas based on periodic functions. 
Communications in Statistics, Theory and Methods, 34(7): 1437-1447. 

Basu, A. and Ghosh, J. (1978). Identifiability of the multinormal and other distribu- 
tions under competing risks model. Journal of Multivariate Analysis, 8:413-429. 

Braekers, R. and Veraverbeke, N. (2008). A conditional Koziol-Green model under 
dependent censoring. Statistics and Probability Letters, 78 (7): 927-937. 

Cox, D. (1959). The analysis of exponentially distributed life-times with two types 
of failure. Journal of the Royal Statistical Society, Series B, 21:411-421. 

Crowder, M. (1991). On the identifiability crisis in competing risks analysis. Scan- 
dinavian Journal of Statistics, 18(3):223-233. 

Csorgo, S. (1988). Estimation in the proportional hazards model of random censor- 
ship. Statistics, 19(3):437-463. 



15 



Csorgo, S. and Faraway, J. J. (1998). The paradoxical nature of the proportional 
hazards model of random censorship. Statistics, 31(l):67-78. 

Csorgo, S. and Horvath, L. (1981). On the Koziol-Green model for random censor- 
ship. Biometrika, 68:391-401. 

Ebrahimi, N., Molefe, D., and Ying, Z. (2003). Identifiability and censored data. 
Biometrika, 90(3):724-727. 

Gumbel, E. (1960). Bivariate exponential distributions. Journal of the American 
Statistical Association, 55:698-707. 

Hyde, J. (1977). Testing survival under right censoring and left truncation. 
Biometrika, 64:22-30. 

Klein, J. P. and Moeschberger, M. (1988). Bounds on net survival probabilities for 
dependent competing risks. Biometrics, 44(2):529-538. 

Koziol, J. A. and Green, S. B. (1976). A Cramer-von Mises statistic for randomly 
censored data. Biometrika, 63:465-474. 

Marshall, A. and Olkin, I. (1967). A multivariate exponential distribution. Journal 
of the American Statistical Association, 62:30-44. 

Nelsen, R. B. (2006). An introduction to copulas. Springer Series in Statistics, second 
edition. 

Rivest, L.-P. and Wells, M. T. (2001). A martingale approach to the copula-graphic 
estimator for the survival function under dependent censoring. Journal of Multi- 
variate Analysis, 79(1):138-155. 

Slud, E. V. and Rubinstein, L. V. (1983). Dependent competing risks and summary 
survival curves. Biometrika, 70:643-649. 

Tsiatis, A. (1975). A nonidentifiability aspect of the problem of competing risks. 
Proceedings of the National Academy of Sciences USA, 72(l):20-22. 

Wang, A. (2012). On the nonidentifiability property of archimedean copula models 
under dependent censoring. Statistics and Probability Letters, 82(3):621 - 625. 

Wysocki, W. (2012). Constructing archimedean copulas from diagonal sections. 
Statistics and Probability Letters, 82(4):818 - 826. 

Zheng, M. and Klein, J. P. (1995). Estimates of marginal survival for dependent 
competing risks based on an assumed copula. Biometrika, 82(1):127-138. 



16 



